Speech Enhancement Using Wavelet Coefficients Masking with Local Binary Patterns

نویسندگان

  • Christian Arcos
  • Marley Vellasco
  • Abraham Alcaim
چکیده

In this paper, we present a wavelet coefficients masking based on Local Binary Patterns (WLBP) approach to enhance the temporal spectra of the wavelet coefficients for speech enhancement. This technique exploits the wavelet denoising scheme, which splits the degraded speech into pyramidal subband components and extracts frequency information without losing temporal information. Speech enhancement in each high-frequency subband is performed by binary labels through the local binary pattern masking that encodes the ratio between the original value of each coefficient and the values of the neighbour coefficients. This approach enhances the high-frequency spectra of the wavelet transform instead of eliminating them through a threshold. A comparative analysis is carried out with conventional speech enhancement algorithms, demonstrating that the proposed technique achieves significant improvements in terms of PESQ, an international recommendation of objective measure for estimating subjective speech quality. Informal listening tests also show that the proposed method in an acoustic context improves the quality of speech, avoiding the annoying musical noise present in other speech enhancement techniques. Experimental results obtained with a DNN based speech recognizer in noisy environments corroborate the superiority of the proposed scheme in the robust speech recognition scenario. Keywords—Binary labels, local binary patterns, mask, wavelet coefficients, speech enhancement, speech recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System

We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized tim...

متن کامل

Wavelet Filter Bank Based Robust Speech Enhancement

WAVELET FILTER BANK BASED ROBUST SPEECH ENHANCEMENT L.M. Kadam, D.S. Aldar, and B.B. Godbole K.B.P. College of Engineering and Polytechnic, Satara E-mail: [email protected], [email protected], [email protected] The paper investigate new speech enhancement scheme to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-to noise ratio....

متن کامل

Speech Enhancement using Statistical Estimators Based on Wavelet Transformations

Estimators for speech enhancement by using wavelet transform is the new technique which is proposed in this paper. Here, we proposed a new set of estimators called magnitude square spectrum estimators beyond the conventional magnitude, power estimators using wavelet transform. Maximum a posteriori(MAP), Minimum Mean Square Error(MMSE) Estimators are derived using hard masking then Soft Masking ...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Integrated speech enhancement and coding in the time-frequency domain

This paper addresses the problem of merging speech enhancement and coding in the context of an auditory modeling. The noisy signal is rst processed by a fast wavelet packet transform algorithm to obtain an auditory spectrum, from which a rough masking model is estimated. Then, this model is used to re ne a subtractive-type enhancement algorithm. The enhanced speech coe cients are then encoded i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017